[AMDGPU] Copy WaveSizePredicate from FLAT_Pseudo to VFLAT_Real #151002

changpeng · 2025-07-28T17:17:11Z

Instructions like load transpose use this predicate.

llvmbot · 2025-07-28T17:17:42Z

@llvm/pr-subscribers-backend-amdgpu

Author: Changpeng Fang (changpeng)

Changes

Instructions like load transpose use this predicate.

Full diff: https://github.com/llvm/llvm-project/pull/151002.diff

1 Files Affected:

(modified) llvm/lib/Target/AMDGPU/FLATInstructions.td (+1)

diff --git a/llvm/lib/Target/AMDGPU/FLATInstructions.td b/llvm/lib/Target/AMDGPU/FLATInstructions.td
index 7207c251994ad..65878b4796a6a 100644
--- a/llvm/lib/Target/AMDGPU/FLATInstructions.td
+++ b/llvm/lib/Target/AMDGPU/FLATInstructions.td
@@ -168,6 +168,7 @@ class VFLAT_Real <bits<8> op, FLAT_Pseudo ps, string opName = ps.Mnemonic> :
   let WaveSizePredicate    = ps.WaveSizePredicate;
   let AsmMatchConverter    = ps.AsmMatchConverter;
   let OtherPredicates      = ps.OtherPredicates;
+  let WaveSizePredicate    = ps.WaveSizePredicate;
   let TSFlags              = ps.TSFlags;
   let UseNamedOperandTable = ps.UseNamedOperandTable;
   let SchedRW              = ps.SchedRW;

shiltian · 2025-07-28T17:35:34Z

and there is no test case needed for this one?

changpeng · 2025-07-28T18:17:35Z

and there is no test case needed for this one?

Right, there is no test needed at this moment.

For gfx1200, the opcode is supported with both wave sizes, so the assembler only reported invalid operand:
global_load_tr_b128 v[1:4], v0, s[0:1] offset:-64
// W64-ERR: :[[@line-1]]:{{[0-9]+}}: error: operands are not valid for this GPU or mode
// W32: encoding: [0x00,0xc0,0x15,0xee,0x01,0x00,0x00,0x00,0x00,0xc0,0xff,0xff]

For gfx1250, it already reported wavesize error:
global_load_tr8_b64 v[2:3], v0, s[0:1]
// GFX1250: global_load_tr8_b64 v[2:3], v0, s[0:1] ; encoding: [0x00,0x00,0x16,0xee,0x02,0x00,0x00,0x00,0x00,0x00,0x00,0x00]
// WAVESIZE-ERR: :[[@LINE-2]]:{{[0-9]+}}: error: instruction requires wavesize=32

shiltian · 2025-07-28T18:52:16Z

but in that case why do we need this then?

You are right. VFLAT_Real already copied it (so not needed). My intention was to add the copy for FLAT_Real class to be in-sync with the downstream branch. Thanks for the question. I am going to abandon this PR

[AMDGPU] Copy WaveSizePredicate from FLAT_Pseudo to VFLAT_Real

4f0f8e9

Instructions like load transpose use this predicate.

llvmbot added the backend:AMDGPU label Jul 28, 2025

changpeng requested review from rampitec and shiltian July 28, 2025 17:17

changpeng closed this Jul 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AMDGPU] Copy WaveSizePredicate from FLAT_Pseudo to VFLAT_Real #151002

[AMDGPU] Copy WaveSizePredicate from FLAT_Pseudo to VFLAT_Real #151002

Uh oh!

changpeng commented Jul 28, 2025

Uh oh!

llvmbot commented Jul 28, 2025

Uh oh!

shiltian commented Jul 28, 2025

Uh oh!

changpeng commented Jul 28, 2025

Uh oh!

shiltian commented Jul 28, 2025 •

edited by changpeng

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[AMDGPU] Copy WaveSizePredicate from FLAT_Pseudo to VFLAT_Real #151002

[AMDGPU] Copy WaveSizePredicate from FLAT_Pseudo to VFLAT_Real #151002

Uh oh!

Conversation

changpeng commented Jul 28, 2025

Uh oh!

llvmbot commented Jul 28, 2025

Uh oh!

shiltian commented Jul 28, 2025

Uh oh!

changpeng commented Jul 28, 2025

Uh oh!

shiltian commented Jul 28, 2025 • edited by changpeng Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

shiltian commented Jul 28, 2025 •

edited by changpeng

Loading